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■ Abstract 

o ■ 

' We consider the anti- ferromagnetic Potts model on the the integer lattice I?. The 

model has two parameters, q, the number of spins, and A ~ exp(— /3), where (3 is "inverse 

' ' temperature" . It is known that the model has strong spatial mixing if g > 7, or if g = 7 

. and A = or A > 1/8, or if g = 6 and A = or A > 1/4. The A = case corresponds to 
the model in which configurations arc proper g-colourings of . We show that the system 

^ . has strong spatial mixing for q > 6 and any A. This implies that Glauber dynamics is 

^ ' rapidly mixing (so there is a fully-polynomial randomised approximation scheme for the 

• i-H . partition function) and also that there is a unique infinite- volume Gibbs state. We also 

' show that strong spatial mixing occurs for a larger range of A than was previously known 

^ . for (7 = 3, 4 and 5. 



1 Introduction and statement of results 
1.1 The anti-ferromagnetic Potts model 

We consider the anti- ferromagnetic Potts model on the integer lattice I?. The set of spins is 
Q = {1, . . . ,q}. Configurations are assignments of spins to vertices, and $7 = Q^^ is the set 
of all configurations. A region R is a (not necessarily connected) subset of vertices, and cr/j 
denotes the restriction of configuration a to R. O/j = is the set of all such restrictions. If 
R is a finite region, then its vertex boundary, dR, is the set of vertices that are not in R, but 
are adjacent to R. A boundary configuration on dR is a function from dR to the set {0} U Q. 
The spin "0" corresponds to a "free boundary" which does not influence the vertices in R. 
Let E{R) denote the set of lattice edges that have at least one vertex in R. Given a region R 

*This work was partially supported by the EPSRC grant "Discontinuous Behaviour in the Complexity of 
Randomized Algorithms" . 
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and a boundary configuration B on dR, the energy of the configuration an G 0,r is given by 
the Hamiltonian 



The partition function Z = "^^^q^ exp{—H{a)). The finite-volume Gibbs measure vrg is the 
distribution on in which, for every a & ■7re((T) = exp{— H (a)) / Z . Letting mono-{E{R)) 
denote the number of monochromatic edges in E{R) and taking A = exp(— it is apparent 
that 7rB(a) is proportional to A™°"'^('^(-^)). 

In the zero-temperature case /? = cxo, A = and vrg is the uniform distribution on "proper" 
colourings, which are configurations without monochromatic edges. In this paper we will focus 
on the situation in which the temperature is non-zero, so A € (0, 1]. 

For any A C i2, tt^^a denotes the distribution on configurations of induced by vrg. 

1.2 Strong spatial mixing 

If the parameters q and A are chosen appropriately, then the anti-ferromagnetic Potts model 
has strong spatial mixing. Informally, this means that for any finite region R, if you consider 
two different boundary configurations B and B' on dR which differ at a single vertex y then 
the effect that this difference has on a subset A <^ R decays exponentially with the distance 
from A to y. The formal definition below is taken from [2j but adapted to the special case of 
the anti- ferromagnetic Potts model on I?. 

Definition 1 The anti-ferromagnetic Potts model on 1? has strong spatial mixing for param- 
eters A and q if there are constants rj and rj' > such that, for any non-empty finite region R, 
any A C R, any vertex y G dR, and any pair of boundary configurations (i3, B') of dR which 
differ only at y, 



where d{y, A) is the lattice distance within R from the vertex y to the region A and dxv 
denotes total variation distance. 

We assume that y is not a free-boundary vertex in either configuration. That is, By £ Q 
and By £ Q. 

Strong spatial mixing is an important property because of two, related, consequences. 
First, strong spatial mixing implies that there is a unique infinite-volume Gibbs measure on 
configurations in 0. Qualitatively, there is one equilibrium, not many. Second, strong spatial 
mixing implies that Glauber dynamics can be used to efficiently sample configurations from ttb 
(for any finite region R and boundary configuration B on dR). We describe both of these 
consequences below before stating our results. 



(ij)6i?(K) 

where /? G M is the "inverse temperature" and 




dTv(7rB,A,7rf5',A) < ?/|A|exp(-r7'd(y,A)), 
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1.3 Uniqueness 



A measure /i on $7 is an infinite-volume Gibbs measure if, for any finite region R and any 
configuration a, the conditional probability distribution | a-^) (conditioned on the config- 
uration on all vertices other than those in R) is T^aaR ■ It is known that there is at least one 
infinite-volume Gibbs measure corresponding to any choice of the parameters q and A. An 
important problem in statistical physics is determining for which parameters this is unique. 
Strong spatial mixing implies that there is a unique infinite-volume Gibbs measure jl5l I19j 
with exponentially decaying correlations. 

1.4 Rapid mixing 

Suppose that i? is a finite region of and that ;S is a boundary configuration on dR. We will 
consider the (heat-bath) Glauber dynamics for sampling from vrg. This is a Markov chain M 
with state space A transition is made from a configuration a £ ^l^hy choosing a vertex v 
uniformly at random from R, "erasing" the spin at vertex v and then choosing a new spin for 
vertex v from the conditional distribution, given cr/j_{i,} and B. Here is a detailed description 
of the transition. 

One step of the (heat-bath) Glauber dynamics Markov chain M: 

1. Choose a vertex v uniformly at random from R. 

2. For i £ Q, let rij denote the number of neighbours of v which are assigned spin i (either 
in cr or in B). 

3. Choose a new spin c according to the distribution 



for i £ Q. 

4. Obtain the new configuration a' from a by assigning spin c to vertex v. 

It is known (for example, see [3]) that A4 is ergodic, with unique stationary distribution 
vrg^. It is also known that if the Potts model has strong spatial mixing (which is true for 
appropriate choices of q and A, as we will see below) then M. is rapidly mixing. 

Before describing what is known about rapid mixing, we recall the definitions. Let P 
denote the transition matrix of A^, and let P*((T, a') be the t-step probability of moving from a 
to a'. For 5 > 0, the mixing time is defined as rx(5) = min{to : dtv{P* , t^b) < ^ for all t > to}. 
M is said to be rapidly mixing if t_m{6) is at most a polynomial in n and log(l/(5), where n 
is the number of vertices in R. 

^It is easy to verify that M is ergodic for the positive temperature case A £ (0, 1] considered in this paper. 
Ergodicity is much more subtle in the zero-temperature case A = 0. Here is an example that is not ergodic 
with A = and g = 5. The region R consists of two adjacent vertices u and v. The boundary configuration 
B assigns colours 3, 4 and 5 to the neighbours of u and the same colours to the neighbours of v. Now M is 
not ergodic since it cannot move between the configuration [u, v) = (1, 2) and the configuration (it, v) — (2, 1). 
However, the chain is ergodic if g > 6 (the maximum degree plus two) and it is ergodic if g > 3 if the boundary 
configuration is chosen appropriately (for example, the free boundary case). See, for example, the ergodicity 
proofs in pilTI]. 
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It is well-known that strong spatial mixing implies rapid mixing in our setting. The 
difficulty of the proof depends upon the precise bound on t_m{6) that is obtained. Dyer, 
Sinclair, Vigoda and Weitz Theorem 2.5] give a nice simple combinatorial proof that strong 
spatial mixing implies that a certain "heat-bath block dynamics" mixes in 0(n log (n/5)) 
time. Markov-chain comparison can now be applied in a standard way to show that Glauber 
dynamics mixes in 0{n(n + log{l/6))) time (see, for example. Section 7 of or (for a slightly 
larger bound) 0). In fact, it is known that strong spatial mixing implies 0{nlog{n/5)) mixing 
of Glauber dynamics, giving a small improvement on the mixing-time bound. As explained 
m 0, this can be proved using techniques from functional analysis [3l 1151 ITHl I18j. The idea 
is to bound the log-Sobolev constant of the block dynamics, and translate this bound into a 
bound on the log-Sobolev constant of Glauber dynamics. 



1.5 Approximating the partition function 

We have seen in Sectional] that when the Potts model has strong spatial mixing, the Markov 
chain M, which corresponds to heat-bath Glauber dynamics, is rapidly mixing. Thus, there 
is an efficient algorithm for sampling from the Gibbs distribution vrg. 

Before stating our results in Section 11.61 we mention one consequence of rapid mixing. 
A randomised approximation scheme is an algorithm for approximately computing the value 
of a function /. The approximation scheme has a parameter e > which specifies the error 
tolerance. For concreteness, suppose that / is a function from S* to M. For example, for fixed 
values of q and A, / might map an encoding of a region R and a boundary configuration B to 
the value of the partition function Z corresponding to R and B. A randomised approximation 
scheme for / is a randomised algorithm that takes as input an instance x £ T,* (e.g., R and 
B) and an error tolerance e > 0, and outputs a number z G Q (a random variable of the "coin 
tosses" made by the algorithm) such that, for every instance x, 



Pr 



(^^<z<il + e)fix) 



>-l 



The randomised approximation scheme is said to be a fully polynomial randomised approxi- 
mation scheme, or FPRAS, if it runs in time bounded by a polynomial in |x| and e~^. Note 
that the quantity 3/4 in Equation could be changed to any value in the open interval 
(^, 1) without changing the set of problems that have randomised approximation schemes. 

Using ideas of Jerrum, Valiant and Vazirani an efficient sampling algorithm for vrg 
can be turned into an FPRAS for the partition function. A straightforward proof is based on 
Dyer and Greenhill's extension of jl3j . 

In summary, if q and A are chosen so that the Potts model has strong spatial mixing then 
Ai is rapid mixing. This, in turn, gives an FPRAS for the partition function. 



1.6 Context and statement of results 

For q = 2 (see |T5j ) it is known that there is a critical point Ac such that uniqueness (and 
strong spatial mixing) hold for A > Ac but there are two Gibbs measures for A < Ac (in one 
of these Gibbs measures, spin 1 is favoured at "even-parity" vertices, and in the other, spin 2 
is favoured). The value of Ac (see dZI) is Ac = \/2 - 1 ~ 0.41. 

Thus, we investigate the case q > 2. It is believed jTT] that there is strong spatial mixing 
for every A € (0, 1] for q = 3 and for every A € [0, 1] for q > 3. The point g = 3, A = is 
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excluded because, on physical grounds, this is believed to be a critical point. It is believed 
that at this point there is a unique infinite-volume Gibbs measure but that the correlations 
only decay algebraically (e.g., polynomially) . Salas and Sokal used Dobrushin uniqueness to 
show that that strong spatial mixing occurs for every A G [0, 1] for q > 8. As Jerrum points 
out Section 5], Salas and Sokal's calculation applies whenever q > 8(1 — A), so it also 
applies to positive A for smaller q. The result applies to a more general context than the one 
studied in this paper — it applies to the anti-ferromagnetic Potts model on any infinite graph. 
The generalised condition is q > 2A(1 — A), where A is the maximum degree. Jerrum jllj 
considered the A = case and showed rapid mixing (in fact, 0{n log{n/5)) mixing) for Glauber 
dynamics when q > 2A (in fact, he considered a slightly different version of Glauber dynamics, 
but the difference is not important here). Jerrum's result implies Salas and Sokal's for A = 
since 0{nlog{n/6)) mixing of Glauber dynamics implies strong spatial mixing Theorem 
2.3]. 

The results that we have just discussed give strong spatial mixing for A = and g > 8. 
In fact, better results are known for A = 0. Salas and Sokal used decimation to prove 
strong spatial mixing for q > 7. This is a machine-assisted proof. The q = 7 case is also 
implied by the work of Bubley, Dyer, Greenhill and Jerrum They gave a machine- assisted 
proof of 0{nlog{n/6)) mixing for a block dynamics on 4-regular triangle-free graphs. As we 
mentioned above, this implies 0(nlog(n/5)) mixing for Glauber dynamics, which, in turn, 
implies strong spatial mixing. A proof without machine assistance of strong spatial mixing 
for q > 7 is given by Goldberg, Martin and Paterson |H1 Theorem 5]. Once again, the 
result applies more generally — in this case to triangle-free graphs with maximum degree at 
most A > 3 where q > 1.76 A - 0.47. 

Achlioptas et al. gave a machine-assisted proof of strong spatial mixing for A = and 
q = 6. Their method was to prove 0(n log(n/5)) mixing for a block dynamics, which implies 
spatial mixing as discussed above. 

It is known that Glauber dynamics is rapidly mixing on rectangular regions when q = 3 
and A = 0. This is proved in the fixed-boundary case by Luby, Randall, and Sinclair and 
in the free-boundary case by Goldberg, Martin and Paterson [Jj. The (polynomial) mixing- 
time bounds are not 0{nlog{n/6)). Indeed, as mentioned above, it is not believed that strong 
spatial mixing holds for A = and q = 3. 

The following proposition summarises the results that we have just discussed. 

Proposition 1 Consider the anti-ferromagnetic Potts model on I? with parameters q and 
A < 1. There is strong spatial mixing in the following cases. 





2 > 8 


and 


A > 


0, 






(ii) ( 


? = 7 


and 


A = 


or 


A > 1/8 = 


0.125 


(Hi) c 


7 = 6 


and 


A = 


or 


A > 2/8 = 


0.25, 


(iv) c 


7 = 5 


and 


A > 


3/8 : 


= 0.375, 




H ' 


7 = 4 


and 


A > 


4/8 : 


= 0.5, and 




(vi) ( 


2 = 3 


and 


A> 


5/8 : 


= 0.625. 





Thus, in these cases, Glauber dynamics is rapidly mixing and there is a unique infinite-volume 
Gibbs measure. 
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The purpose of this work is to improve the results in Proposition^ Our main objective 
was to extend the q = 6 and q = 7 results for A = to all temperatures. We state our results as 
two theorems to separate the results that are proved without machine assistance (Theorem|2I) 
from those that are proved with machine assistance. Theorem |21 subsumes Theorem |2 

Theorem 2 Consider the anti-ferromagnetic Potts model on 1? with parameters q and A < 1. 
There is strong spatial mixing in the following cases. 

(i) q ^ 7 and A > 0, 

(ii) q = 6 and X = or X> 1/7 ^ 0.1429, 
(Hi) q = f> and \ > 2/7 ^ 0.2857, 

(iv) q = A and X> ^(v^ - 5) w 0.3723, and 

(v) (7 = 3 and A > Aq, where Aq ~ 0.4735 is the real solution o/ A^ + 4A — 2 = 0. 

Thus, in these cases, Glauber dynamics is rapidly mixing and there is a unique infinite-volume 
Gibbs measure. 

Theorem 3 Consider the anti-ferromagnetic Potts model on with parameters q and A < 1. 
There is strong spatial mixing in the following cases. 

(i) q > Q and A > 0, 

(ii) q = 5 and X> 0.127, 
(Hi) q = A and A > 0.262, and 
(iv) (7 = 3 and A > 0.393. 

The bounds for q = q = A and q = 2> can be improved further by more extensive machine 
calculation. These results will appear in the PhD thesis of one of the authors jlOj . 

1.7 The anti- ferromagnetic Potts model on general graphs 

In this paper we consider the anti- ferromagnetic Potts model on the integer lattice 1?. One 
reason for restricting attention to 1? is that it is a natural lattice, of interest in statistical 
physics ■ Another reason is that the model is known not to have good mixing properties on 
a general graph. As Welsh observes |20l 3.7.12], the partition function of the Potts model is a 
specialisation of the Tutte Polynomial along the hyperbola Hq = {(x, y) : {x — l){y — 1) = q}- 
The anti-ferromagnetic Potts model (for real temperatures) corresponds to the additional 
constraint < A < 1, which corresponds to a portion of the hyperbola in which x — 1 and 
y — 1 are negative. There is no FPRAS for the Tutte polynomial along this hyperbola unless 
NP=RP 1201 8.7.2]. 

Jerrum and Sinclair |T2] considered the anti-ferromagnetic Ising model, which corresponds 
to the Potts model with q = 2. They used a reduction from MaxCut (the problem of 
counting cut-sets of a specified size in a graph) to show that there is no FPRAS for the 
partition function unless NP=RP. Their proof applies for a particular value of A, but the 
stretching and thickening technique of Jaeger, Vertigan and Welsh [2] can be used to show 
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that there is no FPRAS for any fixed A (see (HI)- Welsh has shown that the same result holds 
for any q >3 j2Ul 8.7.2]. Thus, unless NP=RP, the anti-ferromagnetic model does not exhibit 
strong spatial mixing on a general graph. In this paper, we do not consider a general graph. 
Instead we consider the integer lattice Z^. 

2 Recursive coupling 

2.1 The recursive coupling tree 

The essence of proving strong spatial mixing is showing that, if you take an arbitrary region R 
and boundary configurations B and B' on dR that disagree on a single boundary vertex y, 
then there is a coupling of tt^ and ttq' in which the probability of disagreement at a vertex 
decays exponentially with its distance from y. We will construct such a coupling using the 
recursive method of Goldberg, Martin and Paterson |S]. We start by describing the method. 

Let i? be a non-empty finite region. As in [Sj, we will find it convenient to work with 
the edge-boundary of R rather than with the boundary dR of vertices surrounding R. Here 
is the notation that we will use. The boundary of the region R is the collection of edges 
that have exactly one endpoint in R. A boundary configuration B is a function from the set 
of edges in the boundary to the set {0} U Q. Given a configuration a S ^Ir, the quantity 
mon(j{E{R)) is the number of monochromatic edges in E{R), where a boundary edge is said 
to be "monochromatic" if its spin is the same as the spin that is assigned by a to its endpoint. 
ttb is the Gibbs distribution in which the probability of a is proportional to x^o'^Ti^iR)) _ "Wg 
will be interested in studying how much ttb varies when we change the spin of a single edge 
of B. This small change to the boundary is formalised by the following notation. 

Definition 2 A boundary pair"^ X consists of 

• a non-empty finite region Rx, 

• a distinguished boundary edge sx = {wxifx) with fx G Rx, o.nd 

• a pair {Bx , B'^ ) of boundary configurations which differ only on the edge sx ■ 
We require 

• Bx{sx) £ Q, and 

• B'-^{sx) G Q, and 

• any two perpendicular boundary edges that share a vertex f G dRx have the same spin 
in at least one of the two configurations Bx and B'-^ . 

A coupling ^ of t^Bx vr^^ is a distribution on ^ which has marginal dis- 

tributions 'kbx ^-^d T^B'^- For such a coupling we define l^rj to be the indicator random 

^In the paper this was referred to as a "relevant boundary pair". The reason for the terminology is 
that paper |H| also used the notion of a boundary pair in which the final condition above (the one about 
perpendicular boundary edges) is dropped. Note that this condition depends upon the geometry of the lattice. 
In this paper we always work on the lattice 1? and we always include all conditions listed above so we drop 
the word "relevant" to simplify terminology. 
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variable for the event that, when a pair of configurations is drawn from ^, the spin of / dif- 
fers in these two configurations. For any boundary pair X we define to be some coupling 
of ttbx and tt^/^ minimizing E[l^j-^]. For every pair of spins c and c' , let px{c,c') be the 
probability that, when a pair of configurations (C, C) is drawn from ^x, fx has spin c in C 
and spin c' in C . 

We define a labelled tree Tx associated with each boundary pair X. We will use the tree 
to get an upper bound on the expected number of disagreements at any distance from wx in 
a coupling of ttbx and t^b'^- 

The tree Tx is constructed as follows. Start with a vertex r which will be the root of Tx- 
For every pair of spins c G Q and c' G Q, c ^ c' , add an edge labelled {px{c, c'), fx) from r 
to a new node rc,c'- If fx has no neighbours in Rx then Tc^c' is a leaf. Otherwise, for some 
k G {1, 2, 3}, let ei, . . . , Cfc be the edges from fx to nodes in Rx- If A: = 3 order these edges 
so that ei and 63 are not perpendicular. For each i G {1, . . . , k}, let Xi{c, c') be the boundary 
pair consisting of 

• the region Rx — fx] 

• the distinguished edge e^; 

• the boundary configuration B of Rx — fx that 

— agrees with Bx on common edges, 

— assigns spin c' to ei, . . . , ej_i, and 

— assigns spin c to e^, . . . , 6^; and 

• the boundary configuration B' that agrees with B except that it assigns spin c' to ej. 

Recursively construct Tx,(c,c')' the tree corresponding to boundary pair Xi{c,c')- Add an 
edge with label (1, •) from r^^c' to the root of Tx-[c,c')- That completes the construction of Tx- 
We say that an edge e of Tx is degenerate if the second component of its label is "•" . For 
edges e and e' of Tx , we write e — > e' to denote the fact that e is an ancestor of e'. That is, 
either e = e', or e is a proper ancestor of e'. Define the level of edge e to be the number of 
non-degenerate edges on the path from the root down to, and including, e. Suppose that e is 
an edge of Tx with label (p, /). We say that the weight w{e) of edge e is p. Also the name 
n(e) of edge e is /. The likelihood £{e) of e is He' e'^e '"^(^)- '^^^ ^'^^^ lif^Tx) of a vertex / 
in Tx is Z^e n(e)=/ ^(^)- 1^°^ d > 1., let Ed{X) denote the set of level-d edges in Tx- Let 
T(i{X) = Ylie&Ei(X) ^(^)- following lemma, from 

Lemma 4 Consider the anti- ferromagnetic Potts model on 1? with parameters q and A. 
Suppose there is an e > such that, for every boundary pair X and every d > 1, Td^X) < 
[1 — e)'^. Then the system has strong spatial mixing. 

Proof. The relevance of Tx for providing an upper bound on the quality of the coupling is 
established in Lemma 12 of |H1, which shows that there is a coupling \I' of ttbx and vr^^ such 
that, for all / G Rx, j] < 7(/, Tx) which is at most Ylid>d(f s^) ^di^)-, where d{f, sx) is 
the lattice distance from / to sx- (Thus, d{fx, Sx) = 1 and if / G Rx is adjacent to fx then 
d{f, sx) = 2 and so on.) Following the proof of [SJ Lemma 33], we find that 
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and 

f&Rx 

Following the proof of |S1 Lemma 34], we obtain similar conclusions, assuming that we start 
with a pair of boundary configurations on the boundary dR of vertices surrounding R, such 
that the pair differs only on a particular vertex vx- In particular, there is a coupling ^ such 
that 

6 - ^d{f,vx) 



and 

This implies strong spatial mixing |H1 Corollary 21]. □ 

2.2 Bounding the cost of level-rf edges in the recursive coupling tree 

A key ingredient from the construction of Tx which affects 7(/, Tx) is the quantity E[l^,-^ j^], 
which we denote i^{X). Thus, iy{X) = min^ E[lijf j^], where the minimum is over all cou- 
plings ^ of ttbx and T^B'^- 

In order to get good upper bounds on viX), Goldberg, Martin and Paterson [Hj observed 
that i^iX) can be upper-bounded in terms of corresponding values for boundary pairs with 
smaller regions. They used the following lemma. 

Lemma 5 IB/ Suppose A = 0. Suppose that X is a boundary pair. Let R' be any subset of 
Rx which includes fx- Let x be the set of boundary pairs X' = (Rx' , sx' , Bx' , B'j^,) such 
that Rx' = R' , sx' = sx, Bx' agrees with Bx on common edges, and B'^, agrees with B'^ 
on common edges. Then ^{X) < maxx'ex '^("'^')■ 
Figure^is an illustration of how Lemma El is used to find an upper bound on lyiX). The 
basic idea is to pick a small subregion R' that contains the vertex fx ■ Compute the maximum 
value of v for that subregion, where we maximise over boundary configurations of R' that 
agree with the boundary configurations of Rx on the common overlap of these boundaries. 
This maximum value is an upper bound for i'{X). 





Wx I Wx, 

Figure 1: The application of Lemma |S1 



An interesting feature of the positive-temperature Potts model is that this approach does 
not work. In particular, Lemma [5] does not apply to positive A. For example, suppose q = 2 
and A = 1/2. Consider a region Rx containing fx and one of its neighbours, y, as illustrated 
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below. (In this diagram and all subsequent diagrams we will denote vertices as squares so 
that we have space to label them.) In the diagram, sx is the edge between fx and its lower 
neighbour (which is not pictured). The edge sx is assigned spins 1 and 2 by the two boundary 
configurations Bx and B'^. The rest of the boundary configurations are as shown (assigning 
spins 1, 1, 1, 2 and 2 clockwise around the picture). A calculation shows that i^(X) = 30/91. 
However, if R' is chosen to be the region containing fx only then the corresponding boundary 

pairs X' (depicted to the right) both have iy{X') = 30/100 < 30/91. 

1 

-I- 



y 



1 4 /x f 2 

— I— 

1/2 



4- 



1 4 /x f 2 

— I- 

1/2 



1 - fx - 2 



1/2 



i^{X) = 30/91 



u{X') = 30/100 i^{X') = 30/100 



Our approach is to find an upper bound, fJ,{X), for I'iX) such that fi{X) can be upper- 
bounded using smaller regions along the lines of Lemma[51 Let X be a boundary pair. Recall 
that E[Rx) is the set of lattice edges with at least one endpoint in Rx- For any subset 
E C E{Rx) — {sx} and any configuration a G ^Rx^ m.oiifj{E) denote the number of 
monochromatic edges in E, where a boundary edge is considered to be monochromatic if its 
spin in Bx is the same as the spin assigned by a to its endpoint. For i € Q, let Jlj be the set 
of configurations in that assign spin i to vertex fx- Let q be the total weight of these 
configurations, ignoring edge sx- 



j^mon^{E{Rx)~{sx}) 

Let C contain the two spins assigned to sx by the boundary configurations. That is, C = 
{Bxisx), B'j^{sx)} and let c = X^j^c'Cj. We now define 

(1 - A)q 

niX) = max — . 

^ ^ iec (1 + X)ci +c 

The following lemma enables us to use fi{X) to find upper bounds for z^(X). The intuition 
behind the lemma is best understood from Equations ((2) and ©• Informally, ((21) says that 
the disagreement probability z^(X) is at most the difference between the probability of seeing 
a certain colour in one distribution (with one boundary configuration) and the probability of 
seeing the same colour in the other distribution. A little manipulation gives Equation 
which shows that this quantity is at most n{X). The remainder of the argument shows that 
fi{X) can be upper bounded using smaller regions.^ 

^To see that it is plausible that ij.{X) can be upper bounded using smaller regions, consider the boundary 
configuration B which is the same as the boundary configurations in X except that B{sx) = so sx is a free 
boundary edge. Note that in the expression 

(1 - X)c, _ 1 - A 



(1 + A)c, + c 1 + A+^' 

from the definition of fJ,{X), c/d is the ratio of Prir^ {fx C) to Pr^^ {fx ~ «)• By convexity, this ratio can be 
bounded by considering smaller regions (see the proof for details). Of course, the convexity argument allows 
some flexibility in the exact definition of fJ,{X) and the best thing is to define jJ.{X) so that it is as small as 
possible, subject to the constraint viX) < j-i{X). . 
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Lemma 6 Suppose that X is a boundary pair. Let R' be any subset of Rx which includes 
fx- Let X be the set of boundary pairs X' = {Rx', sx', Bx', B'^^,) such that Rx' = R\ sx' = 
sx,Bx' agrees with Bx on common edges, and B'-^, agrees with B'-^ on common edges. Then 
I'iX) < maxx'g^ /i(X'). 

Proof. Without loss of generality (to simplify notation) suppose Bx{sx) = 1, B'^{sx) = 2, 
and ci > C2. We will show 

(i) iy{X) < fi{X) and 

(ii) fi{X) < maxx'gx/^(^')- 



First, we show (i) Note that 



Aci 



Prn^Jfx = i) = { ^^^tr^^' 



Pr.,, (fx = i) 



Aci+C2+c' 
ci+Ac2+c' 



i = l; 
2<i<q, 

i = 2- 

i = 1,3 < i < q. 



(_ ci+Ac2+c' 

Since ci > C2 and A < 1, the denominator in the expression for Pr^ , {fx = i) exceeds the 

denominator in Pij^g^ (fx = i) so we can couple t^Bx ^"^^ ^^'x ^^^h a way that disagreement 
at fx occurs only when the sample from vr^/^ assigns spin 1 to fx- Thus, 

K^) < Pr^^^(/x = l)-Pr^,^(/x = l) (2) 
ci Aci 



Cl + AC2 + C Aci + C2 + c 

C1(1-A)(C2 + AC2 + C) 

(C2 + Aci + c) (ci + AC2 + c) 

^ ^Mx). (3) 

(1 + Ajci + c 

For |(ii)[ let W = Rx — R' ■ For i ^ Q and p G Vlw let ilj^p be the set of configurations 
fj G with CFfx=i a-iid fivi/ = /). Let 

g.^^ ^ )rnon^(E{Rx)-{sx}) ^ 

let Cp = max(ci^p, C2,p) and let Cp = Yli=3 ^i,p- Then 

( (l-A)ci (1-A)C2 \ 1-A 1-A 

/x(A j = max 



< 



(l + A)ci+c' (l + A)c2 + c; 1 + A + ^ z.,,o^^P 
1-A _ (1 - A) EpGt^^ Cp _ Epenw(l-^)^p 



1 + A + 



Spgny^ (1 + A) EpeCvv- + EpeCw EpGnw((l + ^)cp + ^p) 



(l-A)cp / / (1-A)ci,p (1-A)c2,p 

< max — - — - — = max max ' 



pe^w (1 + A)cp + Cp p&nw \ V (1 + A)ci,p + Cp ' (1 + A)c2,p + c 
max IJ,{X'), 



11 



where X' is the boundary pair in x which Bx' and B'j^, are induced by p. Note that X' is 
a boundary pair — in particular, it satisfies the condition about perpendicular edges. The last 
step follows from the observation that ci^p, C2,p and Cp all contain the factor \™'^'^<t{e{Rx)-e{Rxi)) ^ 
which is constant for a fixed p, and can be cancelled out to obtain p{X'). □ 

3 Proof of Theorem [2] 

We start with a lemma, which we will use to obtain upper bounds on /u(X). The intuition 
behind the lemma is that if Rx is the region consisting of a single node fx then p-{X) is 
maximised by avoiding the colours of sx in the rest of the boundary and otherwise spreading 
colours evenly over the boundary. 

Lemma 7 Suppose that X is a boundary pair in which Rx consists of a node fx only. 
Let V = ?> mod {q — 2) and u = [3/ {q — 2)J . (So u{q — 2) + v = 3.) 



Then 



In particular, if q > 5 



1 + A + t;A"+i + {q-2- 7;)A" ' 
1-A 



Proof. Without loss of generality, suppose Bx{sx) = 1, B'^{sx) = 2, and ci > C2- Let 
E = E{Rx) — Sx, noting that |£^| < 3. Let rij be the number of edges in E that are assigned 
spin i by Bx- Note that Cj = A"% so the constraint ci > C2 just says ^2 > ni. 

Now we wish to choose Bx in order to maximise p{X), or, equivalently, to minimise 



(l-A)ci- 

First note that ni = since Z can be reduced by recolouring edges coloured 1 with 
colour 2. Thus ci = 1. 

Now we want to set n2 , . . . , in order to minimise c = A"^ + • • • + A"' , where na + • • • + < 
3. Since A < 1, we want to take ^3 + • • • + = 3. 

Next, note that there is an optimal solution in which all nj and are within 1 of each 
other. To see this, consider a solution with Uj > n^ + l. The boundary obtained by reassigning 
one of the j edges with spin k has a c-value which is at least as small, since the new c-value 
minus the old one is 

_A"^ _ + ^n.-i + ^nfe+i = (1 _ A)(A"^-i - A"^-) < 0. 

So the optimum value of c is vX^~^^ + {q — 2 — v)X^, which gives the first part of the lemma. 

To derive the bound for ^ > 5 note that for q > 6 we have u = and v = 3. For g = 5 we 
have u = 1 and v = 0. Both of these give the same bound. □ 

We now turn to the proof of Theorem |2 The cases (g > 7), (g = 7, A = 0) and {q = 
6, A = 0) follow from previous work (see Proposition^. For each of the remaining cases we 
will use LemmaI7|to show that if X is a size-1 boundary pair then < 1/3. This implies 
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by Lemma El that every boundary pair X satisfies ^{X) < 1/3 and there is an e > so that 
every boundary pair X satisfies 

y{X)<{l-e)\. 

By induction on d (see Lemma 18 of jH]), we get Tfi{X) < (1 — e)'^ . Hence, by Lemma^we 
have strong spatial mixing (and the theorem is proved). 

We now consider the remaining cases. The second part of Lemma [3 apphes for g > 5 
where q — 4(1 — A) > 3(1 — A), i.e., A > 1 — q/7. This finishes the cases with g > 5. 

For g = 4 we use the first part of Lemma |3 with u = 1 and v = 1 and for g' = 3 we use 
the first part of Lemma [7| with n = 3 and v = 

Remark. Lemmad applies to the Potts model in a more general setting than the one consid- 
ered in this paper. In particular, it applies to the Potts model on a general graph with maxi- 
mum degree A. In the generalised version, the "3" in the definition of v and u becomes "A— 1" . 
The final part of the lemma applies when g' > A -|- 1. It gives ^{X) < (1 — A)/(g — A(l — A)), 
so, for example, we get the following result, which is slightly better than the condition derived 
by Salas and Sokal and discussed in Section EH 

Theorem 8 Consider the anti-ferromagnetic Potts model on a graph G with maximum de- 
gree A with parameters q and A < 1. There is strong spatial mixing if q > {1 — A)(2A — 1). 



4 Proof of Theorem [31 for q = 6 and positive A 

We will prove strong spatial mixing for g = 6 and A > ^ by showing that there is an e > 
such that, for every boundary pair X and every d> 1, Td{X) < (1 - e) . Then we apply 
Lemma m Following Goldberg, Martin and Paterson [S], we will consider the geometry of the 
lattice to derive a system of recurrences whose solution gives the desired bound. 

We start by considering some particular boundary pairs. In particular, we will be inter- 
ested in a boundary pair X such that Rx is one of the seven regions Qi, Q2, Qs, Qa, Q5, Qe, 
and Q7 depicted below. As before, we denote vertices as squares in the diagrams and sx is 
the edge between fx and its lower neighbour. This edge is marked with a short line segment. 



Qi 



Q2 



Q3 



Qa 



Q5 



Qt 















fx 













fx 













fx 













fx 











fx 











fx 





Lemma 9 Suppose q = 6 and X e (0, 1]. Let pi = 41/118, p2 = 179/501, p3 = 79/216, p4 = 
75/202, p5 = 49/129, pe = 27/71 and pr = 3/7. Define qi = pi + 6 for i e {1, . . . ,7} where 
5 = 1/1000. Suppose X is a boundary pair with region Rx = Qi above. Then ^{X) < qi. 



Proof. The lemma is proved by computation. For each region Qi we have considered every 
boundary pair X which has Rx = Qi- Each such boundary pair consists of a pair {Bx, B'^) 
of boundary configurations which differ only on the edge sx, obeying the requirements in 

*The same proof technique applies to the A = case. However we exclude A = because the result is 
already known ^ and excluding A = simplifies our presentation. 
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Definition 12 For each such boundary pair, we calculated a rational function in A, fj,xi^), 
which gives an upper bound on for any particular value of A. The polynomials in the 

numerator and denominator of have integer coefficients. In order to find an upper 

bound on for A G (0, 1], we partitioned the interval [0, 1] into smaller intervals [a,b]. 

We then computed an upper bound for Hx{^) for A G [a,b] by taking A = a for negative 
terms in the numerator and X = b for positive terms in the numerator. All terms in the 
denominator are positive so we use A = a. This computation was carried out exactly with 
no approximations. Working through all boundary pairs X and an appropriate collection of 
intervals [a, b] we established the upper bounds given in the lemma. □ 

Remark. The value pi defined in the statement of Lemma |H1 is defined by 

Pi = ^max nx{0). 

X:Kx=Qi 

is not monotonic in A in general. A simple non-monotonic example is the boundary 
pair consisting of a size-1 region with boundary 1,2,2 where sx is assigned spins 1 and 2. 
For this boundary pair, ci = A, C2 = A^ and C3 = C4 = C5 = cg = 1 so 



(1 - A)ci 



(l-A)A 



(l + A)ci + c (l + A)A + 4' 

Nevertheless, maxx:Rx=Qi fJ'xW seems to be monotonically decreasing in A. 

We now define some sets V, W, U, T, S, R of boundary pairs X. The sets depend only 
on the region Rx and the edge sx, but not on the boundary configurations Bx and B'j^. The 
following diagram illustrates the sets. 



V 




w 






fx 
-1— ' 




fx 




X 




X 


-1- 


X 



u 



R 





X 

fx 
-1— ' 


X 











X 


fx 




X 


fx 


X 


fx 
'— H 


X 



A crossed out square represents a vertex that is not in the region Rx- Squares that are not 
drawn represent vertices that are either in, or not in, the region Rx- As before, the edge sx is 
marked with a short line segment. The diagrams may be rotated according to the symmetries 
of Z^. For example, a boundary pair X belongs to the set R if at least two of the neighbours 
of fx are not in Rx- A boundary pair X belongs to the set U if the left or right neighbour 
of fx (or both) is not in Rx- Obviously these sets are not disjoint. 

We will now define some recurrences. Let F^ denote the maximum, over boundary pairs X, 
of Tfi{X)- Let Vd denote the maximum of F^(X) over boundary pairs X £ V and we use 
similar notation for the other sets. 

Consider a boundary pair X. We will consider six cases below. Every boundary pair is 
covered by exactly one of the cases (up to symmetry). In the diagrams, an empty square 
represents a vertex in the region Rx- As before, a crossed out square represents a vertex not 
in Rx, and all other vertices can be either in Rx or not in Rx- 
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fx 









X 




fx 





Ml 













X 




fx 




X 



X 




X 




fx 







X 






fx 





To see that the cases cover all boundary pairs, note the that the left-most four diagrams 
cover all cases in which all three neighbours of fx are present. The lower central diagram 
applies if neither of the diagonal vertices is present in Rx- The diagram above that applies 
if just one of the diagonal vertices is present. The two diagrams to the left apply if both of 
the diagonal vertices are present. 

We now add an inequality below each diagram giving an upper bound on T^^X) for d >2 
when X is a boundary pair covered by the corresponding case. The inequality arises by 
considering the boundary pairs corresponding to the children of X in the tree Tx- The values 
qi-qj are from Lemma I^J 















fx 









X 




fx 















X 




fx 




X 



X 




X 




fx 





Td{X) < q2iTd-i + 2Trf_i) Td{X) < qQ{2Sd^i + Rd^ 





X 






fx 





Td{X) < q7{2Wd^ 



For example, consider a boundary pair X covered by the lower centre diagram. We will now 
show how to prove Td{X) < qQ{2Sd-i + Rd-i)- In the construction of Tx, for every pair of 
spins c £ Q, c' £ Q, c ^ c' we introduce a child r^^c' of the root r. We construct three boundary 
pairs Xi{c, c') (where the new distinguished edge goes left from fx), X2{c, c') (where the new 
distinguished edge goes up from fx) and X-s{c,c') (where the new distinguished edge goes 
right from fx)- The boundary pair Xi(c,c') is in S (this can be verified by consulting the 
diagram corresponding to S above), so Td-i{Xi{c, c')) < Sd^i- Similarly, ^3(0, c') € S, so 
Td^i{X3{c,c')) < Sd~i- Finally, ^2(0, c') G R (this can be verified by consulting the diagram 
corresponding to R above), so ^2(0, c') < Rd~i- Since i^{X) is the sum of the probabilities 
pxic,c'), we conclude that Td{X) < i^{X){2Sd-i + Rd-i)- Now we apply Lemma |H1 and 
LemmaElto get I'iX) < n{X) < q^. Thus, we have shown Td{X) < qQ{2Sd-i + Rd-i)- The 
other inequalities are derived similarly. 
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Putting all six cases together, we get the following inequality for d >2. 



Fd < max(gi(rrf_i + 2Vd^i), 
q2{rd-i+2Td-i), 
QAiVd-i + Ud-1 + Sd-i), 
QeC^Sd-i + Rd-i), 
Ud, 

q7{2Wd-i)). (4) 

By re-considering similar scenarios with the additional assumption that X £ V we derive 
a corresponding upper bound for Vd- The following cases cover all boundary pairs in V. 























fx 








fx 






X 


1 1 


X 



Ml 








X 




fx 




X 





X 




X 




X 






fx 






fx 




X 


1 


X 





Vd{X) < q4{2Ud-i + Sd-i) Td{X) < q^iSd-i + 2Rd-i) TdiX) < q^{Sd-i + Wd-i) 







X 




fx 




1 


X 



Td{X) < qi{Vd-i + Ud-i + Rd~ 



Putting these together, we get this inequality for d>2. 

Vd < max(gi(C/rf_i + P^-i + Vd-i), 
q2{Rd~i + Pd-i + Td-i), 
q4{2Ud-i+Sd-i), 
qeiSd-i + 2i?d_i), 
q4{Vd-i + Ud^i+Rd~i), 
Ud, 

qriSd^i + Wd^i)). (5) 

In a similar manner we can find an upper bound for Wd, and the following cases cover the 
boundary pairs in W. 



16 











fx 




X 


— 1- 


X 





X 






fx 




X 


— 1- 


X 



TdiX)<qei2Ud^i+Td^i] 



These cases give the following inequality for d > 2. 

Wd < max((/6(2C/rf_i + r^^i), Ud, g7(25d_i)). (6) 
The following cases cover all boundary pairs in U, so we can find an upper bound for Ud- 











fx 


X 



TdiX) < qsi2V, 



d-l, 



X 








fx 


X 



TdiX) < q^iVd^i + Ud^i) TdiX) < qj{Sd-i + Ud^i] 



X 




fx 


X 





X 


fx 


X 



Td{X) < Rd 



These give an upper bound on Ud for d > 2. 

Ud < ma,yi{q3{2Vd-i),q5{Vd-i + Ud-i), qriSd-i + Ud-i),Rd)- 
The following cases illustrate the situation for boundary pairs in S. 



(7) 



X 


fx 


X 


X 














fx 


X 


X 





X 








fx 


X 


X 





Td{X) < Rd 



TdiX) < q^iUd-i + Vd-i) TdiX) < qyiRd-i + Sd-i) 



These give the following inequality for d > 2. 

Sd < maxiRd, q^iUd^i + Vd^i),q7iRd^i + Sd^i)). 



(8) 



Now we derive a corresponding upper bound for Td- The following cases cover all boundary 
pairs in T (apart from those in R). 





X 






fx 




1 


X 





X 






fx 




1 






fx 



TdiX)<q7iWd-i+Sd-i) 



TdiX) < qriWd-i) 



TdiX) < qriSd-i] 
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These give the fohowing inequahty for d > 2. 

Trf < max(i?d, qjiWd-i + Sd-i)). (9) 

Finally, we derive an upper bound for R^. The following cases cover all boundary pairs 
in R. Notice that the middle diagram below does not exactly match the set Qj, but clearly 
we can use the value of to bound ^t{X) also for this case. 










X 


fx 


X 




d-l, 



TdiX) = for fi > 2 



These give the following inequality for d > 2. 

Rd<max{0,q7Wd-i). (10) 

We now set e = 1/1000 and show that for every (i > 1, < (1 — e)'^. We define some 
rational numbers. Let u = s = t = r = 7/10 and v = w = 92/100. We will prove by induction 
on d that Frf < (1 - ef, Vd < v{l - ef, Wd < w;(l - ef, Ud < u{l - ef, Sd < s(l - e)^ 
Td < t{l - ef, and Rd < r(l - ef. 

The base case is d = 1. For any boundary pair X we have Ti(X) < I'iX) < ^t{X) and 
from Lemma 13 

'•(•^' s 6 -4a -A) s 5- 

The base case then follows from the fact that 

- < min(l, V, w, u, s, t, r)(l — e). 

The inductive step follows from the Equations ©, @, ©, 0, (©, ® and (fTU]) . 

First, we use Inequality fTU]) . the facts that r > and e < 1 (so < r(l — e)"^), and the 
fact that qyw < r(l — e) to show Rd < r(l — e)'^. Similarly, we use the inductive hypothesis. 
Inequality © and the facts that r < t and q7{w + s) < t{l — e) to show Td < t{l — s)'^. 

Next, we establish upper bounds on Sd and Ud- To show Sd < s(l — e)*^, we use the 
inductive hypothesis and Inequality together with the upper bound Rd < ^(1 — e)'^ and the 
following facts: r < s, q^^u+v) < s(l— e), and g7(r+s) < s{l—e). To show Ud < u{l — e)'^ , we 
use the inductive hypothesis and Inequality ((TJ together with the upper bound Rd < r(l — e)'^ 
and the following facts: r < u, q32v < (1 — e)n, ^5(1; + u) < (1 — e)n, and (77(5 + n) < (1 — e)u. 

Finally, we establish upper bounds on Wd, Vd and F^. All of these bounds use the 
inductive hypothesis and the upper bound Ud < u{l — e)'^ along with u < w, u < v and 
n < 1. To establish Wd < w{l — e)'^, we use Inequality along with the following facts: 
g6(2n + 1) < (1 — e)w and qj2s < (1 — £)w. To establish Vd < v{l — e)'^, we use Inequality © 
along with the following facts: 

qi{u + l+v) <v{l-e), 
q2{r+l + t) <v{l-e), 

qA{2u + s) < v(l -e), 

q&{s + 2r) <v{l-e), 
qi{v + u + r) < v{l — e), 
qj{s + w) < v{l — e). 
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Finally, to establish < (1 — e) , we use Inequality ((IJ) along with the following facts: 

qi{l + 2v) <l-e, 
<?2(l + 2i) <l-e, 
qi{v + U + s) <l - e, 
%(2s + r) < 

q7{2w) <l-e. 

This concludes the proof of Theorem 01 for (7 = 6. 

5 Proof of Theorem [31 for g = 5, g = 4 and g = 3 

The proof is the same as the proof for (7 = 6 in Section |1] except that for each value of q, we 
compute new values for qi, . . . ,qf (as in Lemma [HI • To find sufficiently small values we need 
to constrain the value of A. If A is too small the values of gi, . . . , gy get too large. We do not 
repeat the values of A already covered by Theorem [21 

Lemma 10 Suppose q = 5 and X e [0.127,0.286]. Let pi = 7/20, p2 = 9/25, p3 = 19/50, 
P4 = 2/5, P5 = 2/5, pq = 2/5 and pj = 1/2. Define q-i = pi + 5 for i E {1, . . . , 7} where 

5 = 1/1000. Suppose X is a boundary pair with region Rx = Qi- Then fi{X) < qi. 

Lemma 11 Suppose q = 4 and X e [0.262,0.373]. Let pi = 7/20, p2 = 19/50, ps = 19/50, 
P4 = 19/50, P5 = 2/5, pe = 19/50 and pj = 1/2. Define qi = Pi + 5 for i G {1, . . . , 7} where 

6 = 1/1000. Suppose X is a boundary pair with region Rx = Qi- Then ^{X) < q^. 

Lemma 12 Suppose q = 3 and X e [0.393,0.474]. Let pi = 873/2500, P2 = 9/25, ps = 
48/125, p4 = 9/25, ps = 39/100, pe = 37/100 and pj = 1/2. Define qi = Pi + 6 for 
i G {1, . . . , 7} where 6 = 1/1000. Suppose X is a boundary pair with region Rx = Qi- Then 
f^{X) < qi- 

Remark. Unlike Lemma El the values of pi in the lemmas above are strict upper bounds on 
maxx:_Rjf =Q, ^x(A), where A is the smallest value in the specified intervals above. Writing 
the exact values of mayiX:Rx=Qi t^x{X) would require many more digits. Again, these values 
seem to be monotonically decreasing in A. 

We use computation in the same manner as for the proof of LemmaElto prove these lemmas. 
Following the proof of the q = 6 case of the theorem and using the values of qi in the 
lemmas above, we can then define new rational numbers v, w, u, t, s and r, and prove 
TdiX) < (1 - ef. 
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